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ABSTRACT 

Diagnosing cognitive errors possessed by examinees 
can be considered as a pattern classification problem that is 
designed to classify a sequential input of stimuli into one of 
several predetermined groups. The sequential inputs in this paper's 
context are item responses, and the predetermined groups are various 
states of knowledge resulting from misconceptions or different 
degrees of incomplete knowledge in a domain. In this study, the 
foundations of a combinatorial algorithm that will provide the 
universal set of states of knowledge will be introduced. Each state 
of knowledge is represented by a list of "can/cannot" cognitive tasks 
and processes (cognitively relevant attributes or latent variables) 
that are usually unobservable . A Boolean descriptive function is 
introduced as a mapping between the attribute space spanned by latent 
attribute variables and the item response space spanned by the item 
score variables. This function uncovers the unobservable content of a 
"black box." Once all possible classes are retrieved explicitly and 
expressed by a set of ideal item response patterns described by a 
"can/cannot" list of latent attributes, the notion of bug 
distributions and statistical pattern classification techniques will 
enable the accurate diagnosis of students' states of knowledge. 
Moreover, investigations on algebraic properties of these 
logically-derived-ideal-response patterns will provide insight into 
the structures of the test and dataset. There are 11 references and 
three illustrative tables. (Author/SLD) 
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ABSTRACT 



Diagnosing cognitive errors possessed by examinees can be 
considered as a pattern classification problem which is designed 
to classify a sequential input of stimuli into one of several 
predetermined groups. The sequential inputs in our context are 
item responses and the predetermined groups are various states of 
knowledge resulting from misconceptions or different degrees of 
incomplete knowledge in a domain. In this study , the foundations 
of a combinatorial algorithm that will provide the universal set 
of states of knowledge will be introduced. Each state of 
knowledge is represented by a list of "can/cannot" cognitive tasks 
and processes (called cognitively relevant attributes or latent 
variables) which are usually unobservable. A Boolean descriptive 
function will be introduced as a mapping between the attribute 
space spanned by latent attribute variables and the item response 
space spanned by item score variables. The Boolean descriptive 
function plays the role of uncovering the unobservable content of 
a black box. Once all the possible classes are retrieved 
explicitly and expressed by a set of ideal item response patterns 
which are described by a H can/cannot H list of latent attributes, 
the notion of bug distributions and statistical pattern 
classification techniques will enable us to diagnose students' 
states of knowledge accurately. Moreover , investigations on 
algebraic properties of these logically derived- ideal-response 
patterns will provide an insight into the structures of the test 
and dataset. 
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Introduction 

A typical pattern classification problem is to classify a 
sequential input of stimuli into one of several predetermined 
groups. The predetermined groups are considered, in our context, 
as latent classes which represent various states of knowledge and 
capabilities, and the stimuli are item response patterns. 
Tatsuoka (1983, 1985) introduced a cognitive error diagnostic 
model (called rule space) in which a student's response pattern to 
the items is classified into one of the predetermined latent 
classes. Each latent class consists of binary patterns that 
deviate from a given ideal response pattern by various numbers of 
slippages. Tatsuoka & Tatsuoka (1987) introduced the slippage 
probabilities and showed that such a class of response patterns 
follows a statistical distribution (called a Bug distribution). 
The ideal response pattern is the outcome of the perfectly 
consistent execution of some erroneous rule of operation or the 
response pattern corresponding to some state of knowledge and 
capabilities without errors of measurement. An error analysis or 
a task anlysis usually provides a list of erroneous rules of 
operations and/or various sources of misconceptions which are 
regarded as latent classes in this paper. However, it is 
important to have a systematic method for obtaining an appropriate 
list of ideal response patterns automatically. The method must be 
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applicable to any domain of interest. In this paper, such a 
method and the theoretical foundation of the method are 
introduced. The theoretical foundation is built upon algebraic 
relations between observable item patterns and latent score 
patterns of various cognitive tasks. Boolean Lattice theory is 
applied to develop the theoretical foundation of a test and data 
structure. 

An Incidence Matrix and Binary Scoring 

Suppose that the underlying characteristics of a domain of 
interest are well identified and involvement relationships between 
the latent attribute variables A^ k-l,...,K (also called 
cognitively relevant attributes) and items are coded by a binary 
matrix. The matrix is called an incidence matrix. Let the 
incidence matrix be a K x n matrix Q where K is the number of 
attributes and n is the number of items. The row vectors of Q, 
A k , k - 1,..., K indicate which items involve the attribute A k . 
Let latent variable Y k be the score of attribute task A*; that is, 
Y k - 1 if attribute ^ is correctly performed and Y k - 0, 
otherwise (if Ak is not a task, and the word "sctre" is not 
suitable, then Y k - 1 could signify -applicable", "belonging to" 
or any "affirmative adjective"). Let Xj be a score variable of 
item j and assume that X d takes the value 1 for the correct answer 
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and 0 for wrong answers. The relationship between latent- score 
pattern y - (Y lt ...Y K ) and the observable item score Xj is given 
by Equation (1) : 

Xj-JJ^ j«l ti (1) 

This equation implies that a response to item j will be correct if 
and only if latent scores Y k of attribute A k for - 1 are ail 
equal to 1. If any one of such latent scores is zero, then the 
item score Xj becomes zero. Needless to say, the meaning of 
Qkj - 0 and Y k - 0 should not be confused because Q^j is an 
involvement index of attribute A k to item j while Y k is the score 
of attribute task A*. The latent score pattern for item j sb.U 
be expressed by 

*j - (YiQij, Y 2 Q 2J YkQfcj) (2) 

where Z kJ - Y^j does not exist when Qy - 0. 

Further let us assume the conditional independence of latent 
variables Y k (k - 1.....K) and manifest variables Xj (j - l,...,n) 
for each performance level B. Let t be the total score of a 
latent score pattern y where we assume a special case, - 1 for 
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k - 1 K, and p k be the probability of attribute A* to be 

performed correctly, 



t - 



(3) 



Then the random variable t follows a binomial distribution if the 
attribute probabilities p k are the same for all k, and a compound 
binomial distribution if the probaMlities are different. 



Prob(t|0) = 



t 

r 

m-0 



E n p* Yk a-Pk) 1_Yk | 

SY k = m k - 1 1 



(A) 



The probability of getting the total score of K, or equivalently 
the pattern of all ones (1, 1, 1) is given by the last term of 
equation (A) . 

Prob(t « K|0) = JlPk (5) 



Let Sj be the total score of a latent pattern Zj for item j , 
then the relationships parallel to equations (4) and (5) for the 
variables s d , z d and the probabilities Pk are given by equations 
(6) and (7), respectively. 
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Prob(sj|0) 



sj 

E 

m»0 



^fjk " S J k-l 



(6) 



The probability of getting a particular pattern z i - 1 is given by 
equation (7) , 



Prob 



sj • £ Qui' 



(7) 



When item score Xj is not binary and the response to item j 
is scored by taking some partial knowledge into account, then the 
above discussion needs to be modified. 
An Inridence Matrix And Partial Credit Scoring 

The elements of a latent pattern Zj - (Z 1Jt Z 2J Z^) of 

item j can be replaced by integers or real numbers. Each element 
Zjtj can be the number of attributes which an examinee answered 
correctly or the weighted sum of the number of attributes answered 
correctly. That is: 



Qkj ■ i 



^kjZkJi 



(8) 
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where - 1 if and only if attribute A* is involved in item j 
and an examinee performed A^ correctly. 

When W kJ is equal to 1 for k - 1 K, then Xj becomes 

simply the number of correct attributes. The larger the Xj value 
is, the higher the level of performance is. Thus, graded response 
or partial credit models (Samej ima,F. f 1969; Masters.G.N. f 1982) 
can be applied. However, Z^, ke{Q kJ - 1} are usually not 
observable. If a multiple -choice item is constructed so as to 
have various subsets of scores of Z kJ for the alternatives, then 
it is possible to apply graded, partial credit or Polychotomous 
models. The partial credit model is formulated for situations in 
which ordered response choices are free to vary in number and 
difficulty from item to item. The restriction of the model is 
that tests are constructed with an ordered response format. 
Polychotomous models (Bock,R.D. ,1972) do not require the ordered 
response format and are applicable to multinomial response 
categories. 

When the weights are not 1, then the W kJ 's indicate that the 
quality of varies over the attributes. Some attributes are 
more difficult while others are less so. It is well known that 
there are ways to get the total score of Sj from s different 

attributes. Some combinations are cognitively more important than 
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others. It will provide us with useful information for 
constructing a good item pool for constructed response items or 
selection of distractors in multiple choice items. 
Lattice and Boolean Algebra 

In the previous section, the attributes were introduced as 
the row vectors of the incidence matrix Q and denoted by vectors 

A k , k-1 K. For example, let us consider the 3x5 incidence 

matrix shown below, where i x is are items and A lf A 2 , and A 3 

a,re attributes: 



A 3 



i\ -*2 *3 *5 



10 10 1 

0 10 0 1 
0 1111 



(9) 



There are three row vectors, A x - (1 0 1 0 1) , A 2 - (0 1 0 0 1) 
and Aj- (0 1 1 1 1). In other words, attribute A x is involved in 
items 1, 3 and 5, attribute A 2 is in items 2 and 5 and attribute 
A 3 is in 2, 3, 4 and 5. Therefore, the attributes can also be 
expressed by a set theoretical notation like A x - {1, 3, 5), 
A 2 - {2, 5) and A 3 - {2, 3, 4, 5). When we discuss the attributes 
in the context of set theory, the attributes are written in non- 
boldface capital letters as Aj,.., A K . If *n incidence matrix Q 
happens to be the identity matrix of order K - n, then A k contains 



U 



Boolean Algebra 
11 

a single item, and A* - (k). 

Let L be a set of subsets obtained from the set of K 
numbers, J - {1, 2,..., K). L will be a lattice and Boolean 
algebra. Lattice and Boolean algebra have been discussed in the 
field of abstract algebra and they have many interesting 
properties. They have been applied to digital computer systems 
and proved to be very useful in providing a simple and precise 
foundation for the analysis of combinatorial switching circuits. 
These properties will play a crucial role in achieving our goal 
which is to obtain the universal set of ideal response patterns 
(or all the possible states of knowledge and capabilities) 
obtainable from a given incidence matrix. Let us start from the 
definition of a lattice. 

Definition 1 A set of sets L is said to be a lattice if two 
binary compositions u and 0 are defined on its subsets (called 
elements hereafter) and they satisfy the following relations: 

11 A U B - B U A, A O B ■ B fl A 

1 2 (A U B) U C - A U (B U C) , (A O B) O C - A O (B O C) 

1 3 A U A - A, A n A - A 

UA U B) n A - A, (A n B) U A - A 
The above conditions are equivalent to saying that a lattice is a 
partially ordered set in which any two elements have a least upper 
bound and a greatest lower bound. The l.u.b. and g.l«b. of any 
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elements A and B in L are given by the union and intersection, 
A u B, and A n B, respectively. Similarly, (A u B) u C is the 
l.u.b. of A, B, C and (A n B) n C is the g.l.b. The order £ in L 
is defined by Definition 2: 

Definition 2 For any pair of elements A and B in L, A & B if and 
only if A u B - A or A n B - B. 

Definition 2 provides us with an equivalent condition for L 
to be a lattice. This order satisfies the asymmetric (if A i B 
and B i A then A - B) and transitivity laws (if A i B and BiC 
then A £ C) , thus L becomes a partially ordered set. Let us 
further define I and 0 as follows: 

K K 

I * IK and 0-fjAk (10) 

then I and 0 belong to L. If the distributive law, 

A n (B u C) - (A n B) u (A n C) (11) 

is satisfied, then L is called a modular lattice. The modular 

condition has an alternative definition: if A S B and 

A U C - B U C and AnC-BnC for any C in L, then A « B. 

The third important operation is complementation. 
Definition 3 The complement A' of A is defined by A'U A - 1 and 
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A'n A - 0. 

For example, the lattice of a set of subsets is complemented 
if the complement of a subset A is the usual set- theoretic 
complement- -that is, the elements of J that do not belong to A. 
Definition 4 A Boolean algebra is a lattice with 1 and 0, the 
distributive law and complementation. 

Definition 4 implies that our lattice L is also a Boolean 
algebra. The most important elementary properties of complements 
in a Boolean algebra may be stated as follows: 

Theorem 1 The complement A # of any element A of a Boolean Algebra 
L is uniquely determined. The mapping A -> A' is one to one, onto 
itself. Then the mapping satisfies conditions 1 and 2: 

1. (A')' - A 

2. (AUB)' - A' n B' and (A n B)' « A' U B' 
The proof may be found in Birkoff (1970). 

A Boolean algebra becomes a Ring with the two operations + and x 
where + is the union set of A and B and x is the intersection of A 
and B. 

Definition 5 For A and B in L, the addition + of A and B is 
defined byA+B-AuB and the product x is defined by A x B - A 
n B. Thus L becomes a Ring* 

It is obvious that L satisfies commutative laws, associative 
laws, the identity laws A + 0 - A, A x I - A, and Idempotent law 
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A + A « A with respect to the new operations + and x. 
Distributive law is also satisfied. In summary, 

1. A + B- B + A, AxB-BxA 

Commutative Laws 

2. (A + B)'-A' xB' t (AxB)'-A' + B' 

Complementation 

3. (A + B) + C - A + (B + C) , (A x B) x C - A x (B x C) 

Associative Laws 

4. A+0-O+A-A, AxI-IxA-A 

Identity 

5. A+A-A, AxA-A 

Idempotence 

6. (A+B)xC-AxC+BxC 

Distributive Law 

The relationship between the attribute vectors 
Ak (k - 1» . . . , K) and the Ring L just introduced will be 
clarified. 

Attribute Response Space and Item Response Space 

When an incidence matrix is the identity matrix of order 
K, then A k will be the unit vector e k - (0,.., 1, 0,.., 0), whose 
k-th element is 1 and the other elements are zero* A Boolean 
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lattice L will then consist of a set of attributes where attribute 
A k corresponds one-to-one to item k or equivalently to e k . 
Therefore, L can be considered as a set of sets of items, or 
equivalently as a set of sets of e k s. In order to distinguish 
between these two sets, the set of sets of attributes is denoted 
by the same notation, L and the set of sets of items (or sets of 
e k ) is denoted by RL, in other words Boolean Algebra of Item 
Response Patterns. Both L and RL are K-dimensional spaces since 
the incidence matrix is the identity of order K. If an incidence 
matrix is not the identity then RL X which associates with a non- 
identity incidence matrix becomes a subspace of RL. It is very 
difficult, in practice, to construct an item-pool whose incidence 
matrix is the identity. Each item in the identity- incidence 
matrix must contain one and only one attribute. It is very common 
that an item involves several attributes and two different items 
usually involve two different sets of attributes. In practice 
most incidence matrices are usually more complicated than the 
identity matrix and their columns and rows contain several ones in 
a variety of cells. 

In the earlier example of 3 x 5 matrix, A x - (1 0 1 0 1) 
corresponds to set A x - {1, 3, 5} ; A 2 - (0 1 0 0 1) to 
A 2 - {2, 5); and A 3 - (0 1 1 1 1) to A 3 - {2, 3, 4, 5). The union 
set of A x and A 2 , {1, 2, 3, 5} corresponds to the addition of 
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Ax +A 2 -(11101) in terms of elementvise Boolean addition. 
Boolean addition is defined by 1+1-1, 1+0-1, 0+1-1 and 
0 + 0-0. The intersection of A x and A 2t {5} corresponds to the 
product of A l x A 2 - (0 0 0 0 1) in terms of elementvise Boolean 
multiplication of 0 and 1. Boolean multiplication follows the 
rules, 0x0-0, 1x0-0x1-0 and lxl-1. It is clear 
that these operations satisfy the above relations 1 through 6. 
The complement of A k is A' k whose elements are obtained by 
switching each element of A* to the opposite; thus complement of 
A x is (0 1 0 1 0), A' 2 is (1 0 1 1 0) and A' 3 is (1 0 0 0 0). It 
is also clear that A* + A' k is equal to 1. 

Suppose A k , k - lf f K are the row vectors of such a 
general incidence matrix, and let RL X be a set of sets of the 
attribute vectors. Then RL X becomes a sublattice of RL which is 
derived from the set of all the response patterns* A subset RL X 
of RL is called a sublattice if it is closed with respect to the 
binary compositions n and u. Further Theorem 2 shows that RL X 
becomes a subring of RL also. 

Theorem 2 A set RL X of sets of row vectors of an incidence matrix 
Q is a Boolean algebra with respect to elementwise Boolean 
addition and multiplication of 0 and 1. 
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Boolean addition and multiplication satisfy the following: 

1 0 + 0-0 

2 1 + 1-1 

3 0 + 1-1 + 0-1 

4 0x0-0 

5 Oxl-lxO-0 

6 lxl-1 

7 (0+1) x0«lx0-0& (0x1) +0-0+0-0 

8 (0+l)xl-lxl-l& (0x1) +1-0+1-1 

9 (0 + 1) + 0 - 0 + (1 + 0) & (0 x 1) x 1 - 0 x (1 x 1). 



Further RL X satisfies 0' - 1 and hence 1' - 0. So RL X is a 
Boolean algebra. For any elements of RLj, Ak + A L is defined by 
elementwise Boolean operations of + and x. Then, any elements A k 
and Aj. of RL X satisfy the lattice conditions given below: 



lj A k + A t - Aj. + Ak & Afc x A]_ - A]_ x Afc 

1 2 (A k + A L ) + A,,, - Ak + (A L + A,,,) & 

Ak x A L ) x An - Ak x (A L x AJ 

1 3 Ak + Ak-Ak &A k xAk-Ak 

1* (A k + A L ) x Ak - Ak & (A L x AJ + A L - A L 

K K 

Let us define 0 - TT and 1 " E K\ then the complement A k ' 
- k-l ~ k-l 
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is defined by Ak + A*' - 1 and ^ x A^' - 0 with elementwise 
Boolean operations of + and x. ' The distributive laws are also 
satisfied from properties 7 and 8, that is A* x (A x + An) - A* x Ax 
+ A* x An. Therefore RL X becomes a Boolean algebra. In the 
example of our 3x5 incidence matrix, the elements 0 and 1 are 
given by 0 - (0 0 0 0 1) and 1 - (1 1 1 1 1) . 
Example : 

10 10 1 
0 10 0 1 
0 1111 

K 

0 " II A k - (0 0 0 0 1) , and 1 - £ A* - (11111), 

k-l - k-1 

V(010 11), VdOlll), and A 3 - (1 0 0 0 1). 

Several properties of RLx are introduced below: 
Property 1 RLj is a subset of all possible response patterns and 
is closed with respect to the Boolean operations. 
Property 2 If Q is the n x n identity matrix, then RLj - RL. 
Property 3 If Ak > A l then Ak + A x - Ak and Ak x A x - A x . 
Example: Since A 3 £ A 2 , A 2 + A 3 - A 3 and A 2 x A 3 - A 2 . 
Property 4 If A* Ax then Ak' < A x ' , (Ak + A x ) ' - Ak' and 
(Ak x A x )' - A x '. 

Property 5 If Q is a Kxn lower triangle matrix (or Guttman scale 
matrix) then RL X consists of K row vectors. 
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If Q is a Guttman scale matrix, then the row vectors are 
totally ordered, Ai ^ A 2 <. . .<, Ak . For any k and 1, with k 1, 
A k + A L - Ak and Ak x Ai - k t from Property 3. Moreover, the 
identity 1 will be A* and the null element 0 will be A x . 

Incidence matrices having this form are often seen in 
attitude tests where measures are coded by ratings. Models such 
as Samejima*s graded response model or Masters' partial credit 
model will be suitable to this form of incidence matrices. These 
models were developed to measure an ordered trait. For such a 
trait, linearly ordered levels or categories within an item exist. 

As a hypothetical example, suppose there are three items: 

1) Add 2/3 and 2/3, then reduce the answer to its simplest 

form, 

2) Add 1/3 and 1/3, and 

3) What is the common denominator of 1/3 and 1/5? 
Then the attributes are: 

A x : Simplify to the simplest form, 
A 2 : Get the numerator, and 
A-,: Get the denominator. 
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The incidence matrix is: 



Q = 



A 2 
A3 



item 1 item 2 item 3 

10 0 
110 
111 



level 1 
level 2 

level 3 



can do 

can do A 3 and A 2 
can do all 



Thus, scores for the levels will be 1, 2, and 3, respectively. 
Property 6 The complement of the sum of A* and A x with respect to 
Boolean addition is the product of the complements of A* and A lt 
(A k + h{) 9 - A' k x A'x. 

Property 7 The complement of the product of A* and Ai with 
respect to Boolean product is the sum of the complements of Ak and 
A lt i.e. , (A* x A x )' ~ A k ' + A]/ . 

Definition 6 A chain is a subset of RLi in which all the elements 
are totally ordered with respect to > or 

Since RLi is a partially ordered set, (and so are L, L} and 
RL) there are usually more than one chain. The order relation is 
not applicable to two different elements coming from two different 
chains. Moreover, two chains may contain the same elements in 
common. Therefore, a tree graph can be drawn by connecting the 
elements in the chains (Tatsuoka & Tatsuoka, 1990). 

The next section introduces a new function by which the 
universal set of ideal response patterns (or all possible states 
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of knowledge) is obtainable from an incidence matrix, and gives 
their descriptive meanings. The description of states are given 
by a list of combinatorial "can/cannot" attributes. 
Boolean Description Function: Determination of Ideal Response 
Patterns As Error-Free States of Knowledge and Capabilities 

There are several interesting relationships between the K- 
dimensional unit vector e k and A*. 

Property 8 The unit vector e k of the latent variable space 
uniquely corresponds to attribute vector A k and the Boolean sum of 
e k# £ e k corresponds to the sum of A*, £ A*. Similarly, the 

k k 

Boolean product of the elements of e k , fj e k uniquely corresponds 

k 

to that of A k , Yl Ait* 
k 

Since our goal is to draw some inferences about latent score 
patterns Y from observable information of item response patterns, 
it is necessary to introduce a series of hypotheses which convert 
the latent-but-interpre table information into observable-and- 
interpre table information. The observable information in our 
context is obtainable only from item responses and we do not 
assume observable information from the latent scores. 
Definition 7 A hypothesis H k is the statement that "one cannot do 
attribute A k correctly but can do the remaining of tl i 
attributes. " H k will produce the item pattern which is the 
complement of A* and represent an interpretable state of 
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knowledge . 

It is clear that if a student cannot perform A* correctly 
but can do the remaining of the attributes right, then the items 
involving attribute A* will have the score of zero but the items 
not involving A* will get the score of 1*. The mapping function, 
A k -> Ak' that takes the complement is equivalent to applying the 
hypothesis H k . 

Property 9 Taking Hypothesis H k is equivalent to taking the 
complement of A* and is denoted by A'*. 

Property 10 The hypothesis H kl+>>>+k i is "one cannot do any of the 
attributes A kl , A k2 ,..,, A kl correctly but can do the rest of the 
attributes" . 

Taking the hypothesis H ku ...^i is equivalent to taking the 
complement of the addition of Aju, . . . f A kl i.e. , 

(A kl +. . .+ A^)'- A k » x x. . .x A k »i. (12) 

The item pattern will be 

Xj - 0 if Q kJ - 1 if there is at least one k in the set 
(kx, k 2 k L ) 

Xj - 1 If Q kJ - 0 for all k in the set {k x , k 2 , . . . , kj . 
As an example, we use the incidence matrix of order 3x5 given on 
p. 11. Table 1 shows various hypotheses and their descriptive 



Boolean Algebra 
23 

outcomes and resulting ideal item patterns. 

Table 1 to be inserted about here 

The description of n cannot/can n in the second column of 
Table 1 corresponds to the latent-score patterns of y's given in 
Table 2. The hypothesis defined in Proerties 9 and 10 and 
Equation (13) provide us with a mapping between attribute patterns 
and ideal item patterns. 

Table 2 to be inserted about here 

This mapping is a Boolean function which plays the role of 
uncovering the contents of a black box. In our situation, latent 
scores on the attributes become observable via this Boolean 
function. 

Definition 8 The mapping f from the attribute response space to 
the item response space is called a Boolean Description Function. 
The Boolean descriptive function f satisfies the following 
property: 

Property 11 For Boolean Description Function f and e k , 

1 f(e k ') - V 

2 f((e x + e k )') - f<e x ' x e k ') - A x ' + V 
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3 f(0) - £ V 

k«l 

4 f(I) - I 

K K K K 

Note that TT An-O, but ( JT Aj J) A k ' + I because £ A*-I 

K ftl k«l k«l k.l 

but 52 A k V 1 in RL 1* 
k-1 

Since RL X is a Ring, it is natural to consider the 
hypotheses that involve interactions of two or more attributes. 
Property 12 The hypothesis H klXffXki is that "one cannot do 
attributes A kl ,...A kl when all of them are involved in a single 
item but can do each separately and can do the remaining 
attributes". This hypothesis corresponds to 

(A kl x. . .x Aia)' - A k »x +. . .+ A k .].. (14) 

The item pattern will be 

Xj - 0 if Qkij - Qk2j -•••Qkij " 1 

Xj - 1 if there is at least one k t such that Q^j - 0 for 
kt, t - 1, 2 1. 

The hypotheses of interactions, (14) also produce the ideal item 
patterns that can be characterized by "can/cannot attributes". 

Insert Table 3 about here 

In our situation, the latent score patterns of the 
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attributes become observable via the Boolean description function. 
If the attribute response space is considered as a linear vector 
space of Y, then the ideal item response patterns generated by the 
hypothesis introduced in Property 10 will be sufficient to 
describe students 9 states of knowledge and capabilities. But as 
can be seen in Table 3, KL 1 contains other ideal item patterns 
generated by the hypotheses H klxk2 x...xki which involve the 
interaction of latent score Ys. These patterns do not correspond 
to the latent attribute score patterns Y in the linear vector 
space spanned by the e k s. For example , the ideal item response 
pattern corresponding to the interaction of attribute scores 
Yi x Jz is produced by Hyx x y2 - In other words, the ideal pattern 
corresponding to the interaction y x x y 2 contains 0s only for the 
items that involve both the attributes A x and A 2 . Since the 
current test theories such as Item Response Theory models require 
the assumption of conditional independence of item responses , they 
may not be applicable to the dataset obtained through the 
hypothesis of the interaction of scores (yiy 2 , or Xj X 2 ) . We will 
restrict the scope of this study to the linear hypothesis of 
Hki+...+ ki> which requires only linearity of y. 

The Boolean description function f is not a one-to-one 
function. As can be seen in Table 1, hypotheses H 3 and H 2 + 3 yield 
the identical item pattern (1 0 0 0 0), and so do H 0 and H 1+3 . The 
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Ideal item patterns resulting from application of two different 
hypotheses may not be always different , and indeed there is a 
systematic relation when two hypotheses produce the same ideal 
item pattern. Property 3, needless to say, implies that any 
element smaller than A* with respect to the order 2 in Lx 
" degenerates M so that addition of A k and Ax becomes A*. That is, 
A x + A k - A k if A k £ A x . Similarly, x Ai - A x if A k 2: A x . 

A special example that is affected by this degenerative 
property is the incidence matrix of Guttman type. This type of 
incidence matrix produces K elements consisting of the original 
row vectors because the row vectors become a single chain of 
length K. The 3x5 incidence matrix used as example above often 
has two chains, A 3 , and A x & A 2 « The distinct elements will be A 3 
and Ax , A2 » 

A 3 + Ai - (1 1 1 1 1), A 2 + A x - (1 1 1 0 1), 
A 3 x A x - (0 0 1 0 1), A 2 x Ax - (0 0 0 0 1). 

Let us introduce an important definition that will be useful 
for determining the number of elements in RLj. 
Definition 9 An element A of Lx is an atom if there are no 
elements between A and 0, or equivalently if A 3 B and A - B f 
imply B - 0. 

Property 13 Atoms in L x can be generated by 
A, - [n^lfl (n^f for a11 P ossible subsets s of 
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J - {1, 2 K) . Or, equivalently A, - fl *k fl A*)' • 

A s s are prospective atoms and some of them may be equal to 0. The 

intersection of two different atoms is 0: A* n Ai - 0. 

Property 14 Any element B of L x can be written as B - I ) A k 
' ke'M 

where A* are atoms and M is an index set. 

Examples of Properties 5 and 6 are illustrated with our 
familiar 3 x 5 Q matrix. Let us consider the index set (1, 2, 3). 
Its non-empty subsets are {1}, {2}, {3}, (1, 2), {1, 3), {2, 3} 
and {1, 2, 3). Then 

a x - A x n (A 2 n A 3 )'- (1 0 0 0 1) 
a 2 - A 2 n (Aj n A 3 )'- (0 0 0 0 1) 
a 3 - A 3 n (Aj n A 2 )'- (0 0 0 1 1) 
a 12 - Ax n A 2 n A 3 '- (0 0 0 0 1) 
a 13 - Ai n A 3 n A 2 '- (00101) 
a 23 - A 2 n A 3 n A x '- (0 1 0 0 1) 
a 123 - Aj n A 2 n A 3 - (0 0 0 0 1) 
As can be seen in the above examples, there are four atoms a x , a 3 , 
a 13 , and a 23 while a 2 , a 12 and a 123 are degenerated to the 0 
element of I*. The original row vectors are written as follows: 
Ai - a x + a 13 
A 2 - a 23 

A 3 - a 23 + ai 3 + a 3 . 
Since every element in RLj is expressed by a combination of atoms, 
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there are 2 4 - 16 elements in RL X . In general, any element in KL X 
is written by a linear combination of the atoms that are linearly 
independent. The number of the atoms will determine the number of 
elements in RL. The atoms are usually not interpretable unless a 
test has the identity incidence matrix. The attributes in an 
identity incidence matrix are atoms. 
Summary and Discussion 

Tatsuoka (1990) discussed an incidence matrix Q that is an 
indication matrix of item characteristics with respect to the 
underlying cognitive processes which are involved in each item. 
These cognitive tasks are called cognitively relevant attributes 
in this study. An advantage of expressing the underlying item 
characteristics explicitly in matrix form is a tremendous benefit: 
First, it enables us to use a variety of scoring methods such as 
right or wrong, graded scores, or partial credit scores. Second, 
it enables us to apply powerful mathematics to investigate 
systematically a variety of relationships among the unobservable 
attributes, between the attributes and the items. Third, it 
enables us to help examine the structure of a test with respect to 
the underlying cognitive tasks. 

Since a set of sets of attributes is a Boolean algebra 
(Boolean Algebra has been used widely in the theory of 
combinatorial circuits of electricity and electronics), 
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unobservable performances on the attributes are viewed as 
unobservable electric current running through various gates if 
they are open. An open gate corresponds to an attribute that is 
answered correctly , and a closed gate to wrong answers* All the 
gates in a circuit must be open so that the current goes through 
it. An item can be answered correctly if and only if all the 
attributes involved in the item can be answered correctly. This 
is an intuitive analogy between the electricity and electronics 
and cognitive processes of answering the items , but Boolean 
Algebra used for explaining various properties of electricity and 
combinatorial circuits can be applied to explain the underlying 
cognitive processes of answering the items. 

The theoretical foundation of relationships between 
observable item response patterns and unobservable responses on 
the attributes which are cognitively relevant to the items is 
given in this study also. A newly defined Boolean descriptive 
function f plays the role of a link between underlying cognitive 
processes of test items and all the response patterns of these 
items. Since the model does not expect that responses on the 
attributes are observable , measures of performances on the 
attributes can not be obtained directly. However > the Boolean 
descriptive function converts unobservable states of knowledge and 
capabilities into a set of observable item patterns which are 
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called ideal item patterns that are free from measurement errors. 
The states of knowledge and capabilities are represented by a list 
of "can/cannot" attributes. The increase of the numbers of states 
is combinatorial, but Boolean algebra provides us with 
mathematical tools to overcome the problem of a combinatorial 
explosion. 

Once a list of predetermined groups or states of knowledge 
and capabilities is determined by a software called "BUGLIB" based 
on this study, then the notion of "bug distribution" (Tatsuoka and 
Tatsuoka, 1987; Tatsuoka, 1990) and statistical pattern 
classification techniques (Tatsuoka, 1985; Lachenbruch, 1975) will 
enable us to diagnose students' states of knowledge accurately. 

Finally, we conclude the study with an important implication 
for modern test theory. An incidence matrix implicitly indicates 
that the attribute scores y-(Y x ,Y 2 , . . . ,Y K ) satisfy local 
independence by a given performance level if we assume local 
independence at the item level. The Item Response Theory models 
are built upon this conditional independence of performance level 
theta. However, the Boolean algebra of a set of sets of response 
patterns is also a Ring, so it permits us to consider the states 
of knowledge and capabilities derived from the interaction of 
attribute scores. The Boolean descriptive function generates the 
ideal item patterns corresponding to the states determined by 
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using interaction of attributes. Such errors states have been 
observed in many studies of "bug analysis" (Brown and Burton, 
1978; Tatsuoka, 1984). A new model that does not assume local 
independence will be needed in the future. 



Boolean Algebra 
32 



References 

Birkoff, B., (1970). Lattice Theory . MacMillan, New York. 
References 

Bock, R. D. (1972). Estimating item parameters and latent ability 

when the responses are scored in two or more nominal 

categories. PsYchoroetrika . 439-459. 
Masters, G. N. (1982). A Rasch Model for partial credit scoring. 

Psvchometrika . 149-174. 
Samejima, F. (1969). Estimation of latent ability using a response 

pattern of graded scores. Psychometrika . Monograph 

supplement. 24(4), Part 2. 
Tatsuoka, K. K. (1990). Toward an Integration of Item-Response 

Theory and Cognitive Error Diagnosis. In N. Fredericksens , 

Glaser, Lesgold and Shafto (Eds.) Diagnostic monitoring of 

skill and knowledge acquisition . Lawrence Erlbaum, 

Hillsdale , New Jersey. 
Tatsuoka M. M. , & Tatsuoka K.K. (1988). Rule Space. In Kotz- 

Johnson (Eds.) Encvclodia of Statistical Sciences . 8. John 

Wiley & Sons. 

Tatsuoka, K. K. , & Tasuoka, M. M. (1987). Bug distribution and 
pattern classification. Psvchometrika . £2(2) , 193-206. 



3K 



Boolean Algebra 
33 

Tatsuoka, K. K. , & Tatsuoka, M. M. (1987). Item response theory. 

Latent Classes and Rule space . (Research Report 87-2-ONR, 

Final Report). Urbana, IL: University of Illinois, CERL. 
Tatsuoka, K. K. (1985). A probabilistic model for diagnosing 

mosconceptions in the pattern classification approach. 

Journal of Educational Statistics . 12(1). 55-73. 
Tatsuoka, K. K. (1984). Analysis of errors in fraction addition 

and subtraction problems . (Final Report for Grant NIE-G- 

0002). Urbana, IL: University of Illinois, CERL* 
Tatsuoka, K. K. (1983). Rule space: An approach for dealing with 

misconceptions based on item response theory. Journal of 

Educational Measurement . £0(4), 345-354. 



9 

ERLC 



37 



Acknovl e dgement 



This research was sponsored by the Cognitive Science 
Program, Cognitive and Neural Science Division, Office of Naval 
Research, under Grant No. N00014-90-J-1307, R&T 4421559. The 
author is grateful to Sung-Ho Kim, Peter Pashley, Charles Lewis 
and Maurice Tatsuoka for valuable discussions, comments and 
suggestions, June Quattromani for typing the manuscript. 



Boolean Algebra 
34 

Table 1 

Boolean Descriptive Function: Case of Linear Hypothesis 



Hypothesis Interpretation Ideal Response Pattern 

H 0 can do everything (11111) 

Hi cannot A lt can A 2 , A 3 (01010) 

H 2 cannot A 2 , can A lt A 3 (10 110) 

H 3 cannot A 3 , can A lt A 2 (1 0 0 0 0) 

H 1+2 cannot A lt A 2 , can A 3 (0 0 0 1 0) 

H 1+3 cannot A lt A 3 , can A 2 (0 0 0 0 0) 

H 2+3 cannot A 2 , A 3 , can A x (1 0 0 0 0) 

H 1+2+3 cannot A lt A 2 , and A 3 (0 0 0 0 0) 
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Table 2 

Correspondence Between Latent Attribute Space and Item Space 



Hypothesis Attribute Score Item Score 

H 0 (111) (1111 1) 

H x (Oil) (0101 0) 

H 2 (10 1) (10 11 0) 

H 3 (110) (10 0 0 0) 

H 1+2 (0 0 1) (0 0 0 1 0) 

H 1+3 (0 1 0) (0 0 0 0 0) 

H 2+3 (1 0 0) (1 0 0 0 0) 

H 1+2+3 (0 0 0) (0 0 0 0 0) 
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Table 3 

Boolean Descriptive Function: Case of Interaction 
Hypothesis Interpretation Ideal Response Pattern 

H2z3 Cannot A 2 and A 3 (10 110) 

together, can Ax 
H lx3 Cannot A x and A 3 (110 10) 

together , can A 2 
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